Awkward Intelligence by Katharina A. Zweig

Awkward Intelligence by Katharina A. Zweig

Author:Katharina A. Zweig [Zweig, Katharina]
Language: eng
Format: epub
Publisher: MIT Press


Figure 33

Between 2010 and 2017, the error rate for image recognition by AI fell from 28 percent to just over 2 percent (results from the ILSVRC, Classification and Localization Division; see https://image-net.org/challenges/LSVRC/index.php).

If your mind works anything like mine does, at this point you might ask whether this sort of approach doesn’t make things pretty easy on the computer. If a computer thinks it detects (a) a sunset, (b) a face, (c) a toothbrush, (d) a fish, and (e) a television in an image, and one of them is correct, did it really win the contest? It wasn’t all that simple for the machine, however. Many images contain more than one object, and it is often unclear which is supposed to be the main one; it’s not always in the middle or taking up the most space.

If you’d like to see the images for yourself, there’s a list of the thousand categories that were supposed to be learned at http://image-net.org/challenges/LSVRC/2012/browse-synsets, all of which come from a larger data set called ImageNet. Each category in ImageNet contains a different number of images, some of which are ridiculously difficult. The category Dungeness Crab, for example, sets an image of a living specimen next to a beautifully arranged, neatly carved up crab where all that’s missing is a squeeze of lemon. In another image, a fairly pale tourist stands half-naked on a beach holding up two discoveries that are only marginally visible.3

If you click through the images, you’ll find that some have been horribly miscategorized. The category Foot Ruler, for example—a ruler that is a foot long—also includes a series of images of feet with no ruler at all, but which are evidently a foot long. The folder also contains a tremendous number of images of fish laid out along a ruler, probably to measure them for a contest. The ruler itself is barely visible. In this case, you would be entirely justified in doubting whether this image was assigned the right central category, especially when anyone can see it’s a fish—pardon me, an American brown trout with an auditory canal on its left-hand side and a spotted tongue. As everyone knows.

Against this backdrop, all you can do is tip your cap to what machines have managed to accomplish with image recognition. When you consider that they are spotting objects that are themselves hidden and may not even play the leading role in an image, and what’s more may contain highly specific categories like different dog breeds, it’s a wonder that machines now only classify 2.5 percent of images incorrectly. In 2010 it was still at 28 percent.

But wait a minute—I still owe you a comparison to human labelers, don’t I? I’m sure you’re imagining that a small army of testers were recruited to solve the same task as the machines. For each image testers would select five potential matching categories, after first having learned the thousand categories possible by heart. But who’s going to learn a thousand categories by heart? Good question! The



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.